Balls into Bins: strict Capacities and Edge Weights
نویسنده
چکیده
We explore a novel theoretical model for studying the performance of distributed storage management systems where the data-centers have limited capacities (as compared to storage space requested by the users). Prior schemes such as Balls-into-bins (used for load balancing) neither consider bin (consumer) capacities (multiple balls into a bin) nor the future performance of the system after, balls (producer requests) are allocated to bins and restrict number of balls as a function of the number of bins. Our problem consists of finding an optimal assignment of the online producer requests to consumers (via weighted edges) in a complete bipartite graph while ensuring that the total size of request assigned on a consumer is limited by its capacity. The metric used to measure the performance in this model is the (minimization of) weighted sum of the requests assigned on the edges (loads) and their corresponding weights. We first explore the optimal offline algorithms followed by the analysis of different online techniques (by comparing their performance against the optimal offline solution). LP and Primal-Dual algorithms are used for calculating the optimal offline solution in O(r · n) time (where r and n are the number of requests and consumers respectively) while randomized algorithms are used for the online case. We propose randomized online algorithms in which the consumers are selected based on edge probabilities (that can change with consumer failures; due to capacity exhaustion) and evaluate the performance of these randomized schemes using probabilistic analysis. The performance of the online algorithms is measured using competitive analysis assuming an oblivious adversary who knows the randomized algorithm but not the results produced. For the simplified model with equal consumer capacities an average-case competitive ratio (which compares the average cost of the output produced by the online algorithm and the minimum cost of the optimal offline solution) of ( d mini, j di, j ) (where d is the edge weight / distance) is achieved using an algorithm that has equal probability for selecting any of the available edges with a running time of O(r). In the extending the model to arbitrary consumer capacities we show an average case competitive ratio of ( d·c c·mini, j di, j ) . This theoretical model gives insights to a (storage) cloud system designer about, how the different attributes (producer requests, edge weights and consumer capacities) effect the overall (read / write) performance of a distributed storage management system over a period of time.
منابع مشابه
On Weighted Balls-into-Bins Games
We consider the well-known problem of randomly allocating m balls into n bins. We investigate various properties of single-choice games as well as multiple-choice games in the context of weighted balls. We are particularly interested in questions that are concerned with the distribution of ball weights, and the order in which balls are allocated. Do any of these parameters influence the maximum...
متن کاملFractional Matching Via Balls-and-Bins
In this paper we relate the problem of finding structures related to perfect matchings in bipartite graphs to a stochastic process similar to throwing balls into bins. Given a bipartite graph with n nodes on each side, we view each node on the left as having balls that it can throw into nodes on the right (bins) to which it is adjacent. If each node on the left throws exactly one ball and each ...
متن کاملBalanced offline allocation of weighted balls into bins
We propose a sorting-based greedy algorithm called SortedGreedy[m] for approximately solving the offline version of the d-choice weighted balls-into-bins problem where the number of choices for each ball is equal to the number of bins. We assume the ball weights to be nonnegative. We compare the performance of the sorting-based algorithm with a näıve algorithm called Greedy[m]. We show that by ...
متن کاملPerfectly Balanced Allocation With Estimated Average Using Expected Constant Retries
Balanced allocation of online balls-into-bins has long been an active area of research for efficient load balancing and hashing applications. There exists a large number of results in this domain for different settings, such as parallel allocations [1], multi-dimensional allocations [5], weighted balls [4] etc. For sequential multi-choice allocation, where m balls are thrown into n bins with ea...
متن کاملBalanced Allocations: Balls-into-Bins Revisited and Chains-into-Bins
The study of balls-into-bins games or occupancy problems has a long history since these processes can be used to translate realistic problems into mathematical ones in a natural way. In general, the goal of a balls-into-bins game is to allocate a set of independent objects (tasks, jobs, balls) to a set of resources (servers, bins, urns) and, thereby, to minimize the maximum load. In this paper ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1206.3634 شماره
صفحات -
تاریخ انتشار 2012